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Abstract 

We consider structural credit modeling in the important special case where the 
log-leverage ratio of the firm is a time-changed Brownian motion (TCBM) with the 
time-change taken to be an independent increasing process. Following the approach 
of Black and Cox, one defines the time of default to be the first passage time for 
the log-leverage ratio to cross the level zero. Rather than adopt the classical notion 
of first passage, with its associated numerical challenges, we accept an alternative 
notion applicable for TCBMs called "first passage of the second kind" . We demon- 
strate how statistical inference can be efficiently implemented in this new class of 
models. This allows us to compare the performance of two versions of TCBMs, 
the variance gamma (VG) model and the exponential jump model (EXP), to the 
Black-Cox model. When applied to a 4.5 year long data set of weekly credit default 
swap (CDS) quotes for Ford Motor Co, the conclusion is that the two TCBM mod- 
els, with essentially one extra parameter, can significantly outperform the classic 
Black-Cox model. 
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1 Introduction 



Next to the Merton credit model of 1974 [12J, the Black-Cox (BC) model [2] is perhaps 
the best known structural credit model. It models the time of a firm's default as the 
first passage time for the firm's log-leverage process, treated as an arithmetic Brownian 
motion, to cross zero. The BC model is conceptually appealing, but its shortcomings, 
such as the rigidity of credit spread curves, the counterfactual behaviour of the short end 
of the credit spread curve and the difficulty of computing correlated multifirm defaults, 
have been amply discussed elsewhere, see e.g. [SJ. Indeed remediation of these different 
flaws has been the impetus for many of the subsequent developments in credit risk. 

One core mathematical difficulty that has hampered widespread implementation of 
Black-Cox style first passage models has been the computation of first passage distribu- 
tions for a richer class of processes one might want to use in modeling the log-leverage 
process. This difficulty was circumvented in [5J, enabling us to explore the consequences 
of using processes that lead to a variety of desirable features: more realistic credit spreads, 
the possibility of strong contagion effects, and "volatility clustering" effects. [5J proposed 
a structural credit modeling framework where the log-leverage ratio X t := log(Vt/K(t)), 
where V t denotes the firm asset value process and K(t) is a deterministic default threshold, 
is a time-changed Brownian motion (TCBM). The time of default is the first passage time 
of the log-leverage ratio across zero. In that paper, the time change was quite general: 
our goal in the present paper is to make a thorough investigation of two simple specifi- 
cations in which the time change is of Levy type that lead to models that incorporate 
specific desirable characteristics. We focus here on a single company, Ford Motor Co., 
and show that with careful parameter estimation, TCBM models can do a very good job 
of explaining the observed dynamics of credit spreads. TCBMs have been used in other 
credit risk models, for example p3], pi], p] and [TT] . 

One model we study is an adaptation of the variance gamma (VG) model introduced 
by [10] in the study of equity derivatives, and remaining very popular since then. We 
will see that this infinite activity pure jump exponential Levy model adapts easily to 
the structural credit context, and that the extra degrees of freedom it allows over and 
above the rigid structure of geometric Brownian motion correspond to desirable features 
of observed credit spread curves. The other model, the exponential (EXP) model, is a 
variation of the Kou-Wang double exponential jump model [7]. Like the VG model it is 
an exponential Levy model, but now with a finite activity exponential jump distribution. 
We find that the EXP model performs remarkably similarly to the VG model when fit to 
our dataset. 

We apply these two prototypical structural credit models to a dataset, divided into 3 
successive 18 month periods, that consists of weekly quotes of credit default swap spreads 
(CDS) on Ford Motor Company. On each date, seven maturities are quoted: 1, 2, 3, 
4, 5, 7, and 10 years. The main advantages of CDS data over more traditional debt 
instruments such as coupon bonds are their greater price transparency, greater liquidity, 
their standardized structure, and the fact that they are usually quoted for more maturities. 

Our paper presents a complete and consistent statistical inference methodology applied 
to this time series of credit data, one that takes full advantage of the fast Fourier transform 
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to speed up the large number of pricing formula evaluations. In our method, the model 
parameters are taken as constants to be estimated for each 18 month time period: in 
contrast to "daily calibration" methods, only the natural dynamical variables, not the 
parameters, are allowed to be time varying. 

Section 2 of this paper summarizes the financial case history of Ford Motor Co. over 
the global credit crisis period. Section 3 reviews the TCBM credit modeling framework 
introduced in [5]. There we include the main formulas for default probability distribu- 
tions, defaultable bond prices and CDS spreads. Each such formula is an explicit Fourier 
transform representation that will be important for achieving a fast algorithm. Section 4 
gives the detailed specification of the two TCBM models under study. Section 5 outlines 
how numerical integration of the default probability formula can be cast in terms of the 
fast Fourier transform. The main theoretical innovation of the paper is the statistical 
inference method unveiled in section 6. In this section, we argue that the naive mea- 
surement equation is problematic due to nonlinearities in the pricing formula, and that 
an alternative measurement equation is more appropriate. We claim that the resultant 
inference scheme exhibits more stable and faster performance than the naive method. In 
Section 7, we outline an approximate numerical scheme that implements the ideal filter 
of Section 6. The detailed results of the estimation to the Ford dataset are summarized 
in Section 8. 

2 Ford: The Test Dataset 

We chose to study the credit history of Ford Motor Co. over the 4.5 year period from 
January 2006 to June 2010. The case history of Ford over this period spanning the global 
credit crisis represents the story of a major firm and its near default, and is thus full of 
financial interest. We have also studied the credit data for a variety of other types of firm 
over this period, and achieved quite similar parameter estimation results. Thus our study 
of Ford truly exemplifies the capabilities of our modeling and estimation framework. 

We divided the period of interest into three nonoverlapping successive 78 week inter- 
vals, one immediately prior to the 2007-2008 credit crisis, another starting at the outset 
of the crisis, the third connecting the crisis and its early recovery. We used Ford CDS 
and US Treasury yield data, taking only Wednesday quotes in order to remove weekday 
effects. 

1. Dataset 1 consisted of Wednesday midquote CDS swap spreads CDS mj r and their 
bid-ask spreads u> m ,r on dates t m = m/52, m — 1, . . . , M for maturities T G T := 
{1, 2, 3, 4, 5, 7, 10} years for Ford Motor Co., for the M = 78 consecutive Wednesdays 
from January 4th, 2006 to June 27, 2007, made available from Bloomberg. 

2. Dataset 2 consisted of Wednesday midquote CDS swap spreads CDS mj r and their 
bid-ask spreads w m> T on dates t m = m/52,m = M + 1,...,2M for maturities 
T G T : = {1,2,3,4,5,7,10} years for Ford Motor Co., for the M = 78 consecu- 
tive Wednesdays from July 11, 2007 to December 31, 2008, made available from 
Bloomberg. 
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3. Dataset 3 consisted of Wednesday midquote CDS swap spreads CDS mj r and their 
bid-ask spreads w m ^ on dates t m = m/52,m = 2M + 1, ...,3M for maturities 
T e T := {1,2,3,4,5,7,10} years for Ford Motor Co., for the M = 78 consec- 
utive Wednesdays from January 7th, 2009 to June 30, 2010, made available from 
Bloomberg. 

4. The US treasury dataselQ consisted of Wednesday yield curves (the "zero curve") 
on dates t m = m/52, m = 1, . . . , 3M, for maturities 

T et := {lm, 3m, 6m, ly, 2y, 3y, 5y, 7y, lOy, 20y, 30y} 
for the period January 4th, 2006 to June 30, 2010. 

We note that Ford Motor Company experienced a large number of credit rating changes 
during this four-and-a-half year period. The history of Standard & Poors (S & P) ratings 
is as follows: BB+ to BB- on January 5, 2006; BB- to B+ on June 28, 2006; B+ to B 
on September 19, 2006; B to B- on July 31, 2008; B- to CCC+ on November 20, 2008. 
The downgrades continued into 2009, with a move from CCC+ to CC on March 4, 2009 
and to SD ("structural default") on April 6, 2009. The latest news was good: on April 
13, 2009, S & P raised Ford's rating back to CCC, on November 3, 2009 to B-, and on 
August 2, 2010 to B+, the highest since the onset of the credit crisis. 

In hindsight we see that Ford never actually defaulted, although it came close. In 
the following estimation methodology, we consider the non-observation of default as an 
additional piece of information about the firm. 

3 The TCBM Credit Setup 

The time-changed Brownian motion credit framework of |5j starts with a filtered prob- 
ability space (Q, J 7 , J-" t , P), which is assumed to support a Brownian motion W and an 
independent increasing process G where the natural filtration F t contains o~{G u , W v : u < 
t, v < G t } and satisfies the "usual conditions". P is taken to be the physical probability 
measure. 

Assumptions 1. 1. The log-leverage ratio of the firm, X t := log(Vt/K(t)) := x + 
cWct + f3cr 2 G t is a TCBM with parameters x > 0, a > and (3. The time change G t 
is characterized by its Laplace exponent ijj(u,t) := — logE[e~ uGt ] which is assumed 
to be known explicitly and has average speed normalized to 1 by the condition 

lim r 1 dip(0,t)/du = 1. 

t— >oo 

2. The time of default of the firm is the first passage time of the second kind for the log- 
leverage ratio to hit zero (see the definition that follows). The recovery at default is 
modelled by the "recovery of treasury" mechanism^] with constant recovery fraction 

R e [0,1). 

1 Obtained from US Federal Reserve Bank, www.federalreserve.gov/datadownload 
2 See 0. 
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3. The family of default-free zero-coupon bond price processes {B t (T), < t < T < 00} 
is free of arbitrage and independent of the processes W and G. 

4. There is a probability measure Q, equivalent to P and called the risk-neutral mea- 
sure, under which all discounted asset price processes are assumed to be martingales. 
Under Q, the distribution of the time change G is unchanged while the Brownian 
motion W has constant driftj^ We may write X t = x + o~W2 t + ^Q^ 2 G t for some 
constant (3q where = W u + <r(/3 — (3q)u is driftless Brownian motion under Q. 

We recall the definitions from [5] of first passage times for a TCBM X t starting at a 
point X = x > to hit zero. 

Definition 2. • The standard definition of first passage time is the J 7 stopping time 

t w = M{t\X t < 0} . (1) 

The corresponding stopped TCBM is xjp = X tAt (i). Note that in general xj}^ < 0. 
• The first passage time of the second kind is the J 7 stopping time 

t (2) = inf{t|G t > £*} (2) 
where t* = inf{t|x + aW t + (3a 2 t < 0}. The corresponding stopped TCBM is 

X t (2) = x + aW GtAt . + (3a 2 (G t A £*) (3) 

(2) 

and we note that X\^ = 0. 

The general relation between and is studied in detail in [6] where it is shown 
how the probability distribution of can approximate that of t^ 1 '. For the remainder of 
this paper, however, we consider to be the definition of the time of default. 

The following proposition^ proved in [5] , is the basis for computing credit derivatives 
in the TCBM modeling framework. 

Proposition 3. Suppose the firm's log-leverage ratio X t is a TCBM with a > and that 
Assumptions 1 hold. 

1. For any t > 0,x > the risk-neutral survival probability P^ 2 \t,x) := E, x [lr t (2) >t y] is 
given by 
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__ e *w+r) Mdu + (1 _ e -^)l {/3>0} , (4) 



3 This assumption can be justified by a particular version of the Girsanov theorem. It would be natural 
to allow the distribution of G to be different under Q, but for simplicity we do not consider this possibility 
further here. 

4 Equation ^ given in [5] only deals with the case j3 < 0. The proof of the extension for all (3 is 
available by contacting the authors. 
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The density for X t conditioned on no default is 

p(y,t,x) := j-E x [l {Xt < yy \tW>t] (5) 

= P^ 2 \t, X^l^oy- / [ e Mv-*) _ e ^u(y + ^ e -^Hu* + fi)/2,t) du 

The characteristic function for X t conditioned on no default is 

E x [e ikXt \t^>t] = P( 2 )(t,z)-%[e ifcx *.l {t c 2)> , } ] (6) 
= P^(t,xr e — [ /a USi ^ X) e ^t^)AO dti 

J R (/3 + ^) 2 + M 2 
+ ^ite _ e -ite-2^) e -V(^(fe 2 -2i^)/2 lf ) Ql {/3=0} + l {/3>0} 

T/ie time price B RT (T) of a defaultable zero coupon bond with maturity T and 
recovery of treasury with a fixed fraction R is 

B RT (T) = B(T)[P^(T,x)) + R(l - P {2) (T,x))} (7) 

3. The fair swap rate for a CDS contract with maturity T = NAt, with premiums paid 
in arrears on dates tk = kAt, k = 1, . . . , N , and the default payment of (1 — R) paid 
at the end of the period when default occurs, is given by 

(1 " R) [Et^l " P {2) (t k ,x))[B(t k ) - B{t k+l )] + B(T)[1 - P( 2 )(T,x)]" 
CDS(x, T) = l - 



AtJ2ti p(2) (tk,x)B(t k ) 

(8) 

Remarks 4. • We shall be using the above formulas in both measures P and Q, as 
appropriate. 

• We observe in Q that the survival and default probabilities are invariant under the 
following joint rescaling of parameters 

(x, cr, /3) -> (Xz, Aa, A -1 /3), for any A > 0. (9) 

It follows that all pure credit derivative prices are invariant under this rescaling. 

4 Two TCBM Credit Models 

The two credit models we introduce here generalize the standard Black-Cox model that 
takes Xt = x + o~Wt + (5a 2 t. They are chosen to illustrate the flexibility inherent in our 
modeling approach. Many other specifications of the time change are certainly possible 
and remain to be studied in more detail. The following models are specified under the 
measure P: by Assumption [T] they have the same form under the risk-neutral measure Q, 
but with (3 replaced by (3n. 



6 



4.1 The Variance Gamma Model 

The VG credit model with its parameters 9 = (a, f3, b, c, R) arises by taking G to be a 
gamma process with drift defined by the characteristic triple (b, 0, u) with b G (0, 1) and 
jump measure u(z) = ce~ z ^ a /z 1 a > on (0, oo). The Laplace exponent of G t is 

^ VG (u,t) := - logE[e- uGt ] = t[bu + clog(l + ait)]. (10) 

and by choosing a = — the average speed of the time change is t~ 1 dijj VG (0,t)/du = 1. 
This model and the next both lead to a log-leverage process of Levy type, that is, a 
process with identical independent increments that are infinitely divisible. 



4.2 The Exponential Model 

The EXP credit model with its parameters 9 = (a, (3, b, c, R) arises taking by G to be 
a Levy process with a characteristic triple (6, 0, z/) with b G (0,1) and jump measure 
u(z) = ce~ z / a /a,a > on (0, oo). The Laplace exponent of G t is 

il> Exp (u,t) := -\ogE[e- uGt ] =t 

and by choosing a = — the average speed of the time change is t~ 1 dijj VG (0,t)/du = 1. 



bu + 



acu 
1 + au 



5 Numerical Integration 

Statistical inference in these models requires a large number of evaluations of the integral 
formula (|4]) that must be done carefully to avoid dangerous errors and excessive costs. To 
this end, we approximate the integral by a discrete Fourier transform over the lattice 

T = { u (k) = -u + kr)\k = 0, 1, . . . , N - 1} 

for appropriate choices of N,i],u := Nrj/2. It is convenient to take N to be a power of 2 
and lattice spacing rj such that truncation of the ^-integrals to [— u, u) and discretization 
leads to an acceptable error. If we choose initial values x to lie on the reciprocal lattice 
with spacing 77* = 27r/Ni] = ir/u 

T* = { x (£) =£ri*\£ = Q,l,...,N -1} 

then the approximation is implementable as a fast Fourier transform (FFT): 

P {2) (t,x(i)) ~ -?5L (1 T R 2 ex P HVKfc) 2 + /3 2 )/2,t)] (11) 



K M (A;)2 + /3 

M(fc) 2 + /3 



A:=0 

AT-1 



1) ^ "W* exp[-^( ( x 2 ( M (fc) 2 + /3 2 )/2, t)] (12) 



fc=0 
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Note that we have used the fact that e -*Mj*(<)/2 = for all leZ. 

The selection of suitable values for N and 77 in the above FFT approximation of Q 
is determined via general error bounds proved in [§] . In rough terms, the pure truncation 
error, defined by taking 77 — > 0, N — >■ 00 keeping u = Nt]/2 fixed, can be made small 
if the integrand of pi) is small and decaying outside the square [—u, u) . Similarly, the 
pure discretization error, defined by taking u — > 00, N — > 00 while keeping 77 fixed, can be 
made small if e~^ x P^(x, t), or more simply e~'^' x , is small, where x := 71/77. One expects 
that the combined truncation and discretization error will be small if u and rj = tc/x are 
each chosen as above. These error bounds for the FFT are more powerful than bounds 
one finds for generic integration by the trapezoid rule, and constitute one big advantage 
of the FFT. A second important advantage to the FFT is its 0(N log N) computational 
efficiency that yields P^ on a lattice of x values with spacing 77* = 2n/Nr] = tt/u: this 
aspect will be very useful in estimation. These two advantages are offset by the problem 
that the FFT computes values for x only on a grid. 

We now discuss choices for N and 77 in our two TCBM models. For < 0, the survival 
function of the VG model is 



P (2) (0,t,x,/3) 



7T 



exp[-tba 2 (u 2 + f3 2 )/2] 1 + 



aa 2 (u 2 + (3 2 ) 



-ct 



u sin ux 



u 2 + fr 



du 



while for the EXP model 

-fix 



7T 



cxp 



-t lba 2 (u 2 + (3 2 )/2 + 



aca 2 (u 2 + (3 2 ) 
2 + aa 2 (u 2 + ft 2 ) 



u sin ux 



u 2 + F 



du 



In both models, the truncation error has an upper bound e when u > C|$ _1 (eC / )|, where 
is the inverse normal CDF and C, C are constants depending on t. On the other 
hand, provided < 0, the discretization error will be small (of order e or smaller) if 



N > 



2ttL 



fog! 



--1/ 



exp (-2/3x))). Errors for @ can be controlled similarly. 



6 The Statistical Method 

The primary aim of this exercise is to demonstrate that our two TCBM credit models can 
be successfully and efficiently implemented to fit market CDS data on a single firm, in 
this case Ford Motor Company, and to compare these models' performance to the original 
Black-Cox structural model. 

We were able to reduce the complexity of our models with negligible loss in accuracy 
by removing what appear to be two "nuisance parameters". First, we expect, and it was 
observed, that parameter estimations were not very sensitive to (3 near j3 = 0, so we 
arbitrarily set /3 = —0.5. Secondly, we observed insensitivity to the parameter b and a 
tendency for it to drift slowly to zero under maximum likelihood iteration: since b = is 
a singular limit, we set b = 0.2. Finally, in view of the rescaling invariance @, and the 
interpretation of a as the volatility of X, without loss of generality we set a = 0.3 in all 
models. So specified, the two TCBM models have three free parameters O = (c, (3q, R) 
as well as three frozen parameters a = 0.3, /3 = —0.5, b = 0.2. The Black-Cox model with 
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its free parameters = (/3q,R) and frozen parameters o = 0.3,(3 = —0.5 then nests as 
the c = limit inside both the VG and EXP models. 
We summarize the modeling ingredients: 

• an unobserved Markov process X t 6 M. d ; 

• model parameters 06flc K". We augment the vector — > to include an 
additional measurement error parameter rj; 

• model formulas F k (X, 0) for k = 1, . . . , K, which in our case are theoretical CDS 
spreads given by ^ for K = 7 different tenors; 

• a dataset consisting of spreads Y := {Y t } observed at times t = 1, . . . ,M where 
Y t = {Y t k } for a term structure of k — 1, . . . , K, plus their associated quoted bid/ask 
spreads w k . We use notation Y< t : = {Yi, . . . , Y t } and Y <t := {Y 1; . . . , Y t _{\ etc. 

Since we do not attempt to estimate an underlying interest rate model, we treat the 
US Treasury dataset as giving us exact information about the term structure of interest 
rates, and hence the discount factors entering into (18]). We treat the quoted bid/ask 
spreads w k as a proxy for measurement error: these will simplify our treatment of the 
measurement equation. We also treat the non-default status of Ford on each date as an 
additional observation. 

To complete the framework, an arbitrary Bayesian prior density of is taken 

p o (0) := 

with support on D C K" +1 . The statistical method appropriate to a problem like this 
is thus some variant of a nonlinear Kalman filter, combined with maximum likelihood 
parameter estimation. 

Based on these assumptions, it is rather natural to assume that observed credit spreads 
provide measurements of the hidden state vector X with independent gaussian errors. 
Moreover the measurement errors may be taken proportional to the observed bid/ask 
spread. Thus a natural measurement equation is 

Y t k = F k (X u &) + 7 1 w^ (13) 

where Q are independent standard gaussian random variables and rj is constant. In this 
case the full measurement density of Y would be 

f(y\x,&)= n n 

t=l,...,M k=l,...,K 

However, we observed an important deficiency that seems to arise in any scheme like 
this where the measurement equation involves a nonlinear function of an unobserved 
process X. This nonlinearity leads to nonconvexity in the log-likelihood function for X, 
which in turn can destabilize the parameter estimation procedure. For such reasons, we 
instead follow an alternative scheme that in our problem, and perhaps many others of this 



2ttt]w 



cxp 



(Y k -F k (X t ,Q)Y 



2rf{w 



k\2 



(14) 
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type, gives a great improvement in estimation efficiency. It works in our case because the 
model formula ^ for F k (x, G), although nonlinear in x, is monotonic and approximately 
linear in x. We will call our scheme the "linearized measurement" scheme and it is justified 
as follows. 

We define G k (Y, 0) to be the solution x of Y = F k (x, 6), and note that f k := d x F k > 
0. Then, provided rjw k are small enough, we may linearize the x dependence of the 
measurement equation using the Taylor expansion 

Y k -F k (x) = Y k - F k (G k {Y k )+x-G k {Y k )) 

« Y k - F k {G k {Y k )) + f k (G k {Y k ))(G k {Y k ) - x) 
= f k {G k (Y k )){G k (Y k )-x) 

This equation above justifies the following alternative to the measurement equation p3| ): 

X k = X t + V w k i k (15) 



Now £ k , k = 1,2, ... ,K,t = 1, 2, . . . , M are iid N(0, 1) random variables and the trans- 
formed measurements are 

X k = X k {Y k , 6) := G k {Y k , 0). 

Furthermore, 

w k = w k (x k ,Q) = f k (x k ,ey 1 w k . 

Note that X k , k = 1, . . . , K have the interpretation as independent direct measurements 
of the unobserved state value X t . 

The full measurement density of Y in our linearized measurement scheme is thus: 

7{Y\X,Q) := J] f(Y t \X t ,Q) (16) 

t=l,...,M 

f(Y t \x t ,e) := n 

k=l,...,K 

where we have recombined denominator factors of w k with Jacobian factors (f k )~ l - The 
multiperiod transition density conditioned on nondefault is 

V(X\B, no default) = ] [ p(X t \X t ^ 1 ,Q) (18) 

t=2,...,M 

where p(y\x, Q) is the one period conditional transition density given by ^ with t = At. 
Finally the full joint density for (X, Y,Q) is 

p(X, Y, 6) := T(Y\X, e)V(X\Q)p Q (e) (19) 



27tr]Wt 



exp 



(X k (Y k ,Q)-X t 
2r] 2 w^{Q) 2 



(17) 
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Integration over the hidden state variables X leads to the partial likelihood function, 
which can be defined through an iteration scheme: 

p(Y, 0) = J f(Y M \X M , Q)p(X M , Y <M , Q)dX M (20) 

where for t < M 

( fp(X t+1 \X t7 Q)f(Y t \X t ,Q)p(X t ,Y <t ,e)dX t , t>0 
p(X t+1 ,Y <t ,Q)= I (21) 

( Po(e) t = o 

The following summarizes statistical inference within the linearized measurement scheme. 

Statistical Inference using the Linearized Measurement Scheme: Let (Y,w) := 
{Y t ,w^} be the time series of CDS observations. 

1. Maximum Likelihood Inference: The maximum likelihood parameter estimates 
are the solutions of 

= argmax eeD log (p(Y, 0)/po(0)) (22) 
where p(Y, 0) is given by (20). The log-likelihood achieved by this solution is 

£:=log (p(F,0)/po(0^ 
and the Fisher information matrix is 



4 log (p(Y,e)/p (e) 



J:= - 

2. Filtered State Inference: The time series of filtered estimates of the state variables 
Xi, . . . , Xm are the solutions Xi, . . . , Xm of 

X t = argmax^^ log (f{Y t \x, 0)p(x, Y< t _ x , 0)J (23) 

7 Approximate Inference 

The previous discussion on inference was exact, but computationally infeasible. Our 
aim now is to give a natural and simple approximation scheme that will be effective for 
the problem at hand. Our scheme is to inductively approximate the likelihood function 



p(X t+ i, Y< t , 0) defined by (21) by a truncated normal distribution through matching of 
the first two moments. The truncation point of is determined by the no default condition. 
The rationale is that the non-gaussian nature of the transition density p will have only 
a small effect when combined with the gaussian measurement density /. We expect our 
approximation to be appropriate for a firm like Ford that spent a substantial period near 
default. As we discuss at the end of this section, a simpler approximation is available 
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that is applicable to a firm of high credit quality. The more complicated method we now 
describe is intended to be more robust when applied to firms of a range of credit qualities. 



We describe a single step of the inductive computation of p given by (21). We fix t, 
denote the time t state variable as x and the time t + 1 state variable as capital X. The 
length between t and t + 1 is denoted as At. We also suppress Y< t and 0. In this context, 
we are looking for p and a that satisfy 

P{X)~ \ X>0 (24) 

where 

poo 

m o — / f(x)p(x)dx. (25) 



JO 

Here <ft and $ are probability density and cumulative distribution functions of the standard 
normal distribution and p is carried over from the previous time step. The first two 
moments of the truncated normal distribution are straightforward to derive and are given 
here for completeness: 

m trunc = fx + (J X(a) 

m trunc = a 2 [l - 5{a)] + {m\ runc ) 2 

where a = — |,A(a) = iz^a)^( a ) — A (a) [A (a) — a]. Note that the truncated normal 
distribution has a larger mean and smaller variance than the original normal distribution. 
Using the Fubini theorem, the first two moments of the distribution p(X) are: 

POO 

m 1 = rriQ 1 / gi(x)f(x)p(x)dx (26) 
Jo 

POO 

m 2 = rriQ 1 / g 2 (x)f(x) p{x)dx 
Jo 

Here gi(x) and g 2 (x) are the first and second moments of X with respect to the transition 
density p(X\x) and are given using ^ by 

gi (x) = ^d k \ k=0 E^[e tkX \t^ > At] (27) 
g 2 (x) = -d 2 k \ k=0 E^[e lkX \t^ > At] 

Note that p(x) has a gaussian kernel approximation by induction and the measurement 
density f(x) is also gaussian. Their product gaussian kernel is then simply a scaled normal 
probability density function: 

. . . . mn fx — m\ . . 

m~p{x) = 6 — - (28) 

We also notice that the transition density p(X\x) with a short period At resembles a 
Dirac 5 function of X and fitting it to a polynomial would require very high order to 
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guarantee accuracy in a local domain. In our method, by contrast, the moment functions 



g±(x) and g2(x) that appear in the integrals in (26) are much smoother functions of x and 
usually low order polynomials can approximate them quite accurately in a local domain. 
Take a normal transition density for example: g\{x) is linear in x and g2{x) is quadratic 
in x. Their counterparts for time changed Brownian motion conditional on no default 
can also be well approximated by low order polynomials in a local domain. We stress the 
word "local" because the product gaussian kernel fp typically has a moderate variance v 



and relatively large mean rh: therefore the integrals in Equation (26) are dominated by a 
local domain [rh — ay/ii, rh + a\/iJ] with a safely taken to be 4. Thus we need to fit g\(x) 
and g2^x) over the interval [rh — ay/ii, rh + a\/il] which can be done quite accurately with 
quartic polynomials: 

9i(x) = Zt =0 c lk (x-m) k (29) 
g 2 (x) = S^ =0 c 2 fc(x - rh) k . 

Equation (26) is now approximated by 

te^ -,/=, [ ^t=o c ik(x-rh) k (f) ( a —^- J dx 

m \/v®{m/\/v) Jo \ yv J 

1 f°° „/i / a , fx — rh 



nil 



m 2 = jett~z , /=. / ^t=o c 2k(x - m) k (f) ( '" ) dx 

m v v<£[m/ \/v) Jo \ yv 



which can be evaluated analytically in terms of the error function. Matching ni\ and ni2 
with m t { unc and m t 2 runc determines p, and a and completes the iteration scheme for (20). 



Remarks 5. 



In our numerical examples, we enlarge the integral domain in Equation (30) from 
]R + to R if rh > 4\/iJ, which leads to a simpler implementation. It turns out in our 
study that this condition is satisfied for all sampling periods. 

An alternative moment matching approximation is possible which approximates 
p(X) by a regular normal distribution, rather than a truncated normal. Then 



the truncated density in Equation (24) should be replaced by the regular density 
(f) (— ^) ,1 £ R. Although this approximation conflicts with the default barrier, 
for a firm that is far from default this does not introduce a serious numerical error. 
Moreover, this approximation leads to linear gaussian transition density and is thus 
a Kalman filter. 

Here we summarize the computation of p(Y<M, ©) for a fixed value of O: 

1. Set pi = p (@); 

2. Compute the measurement density f(Yi\Xi) (i.e. compute its mean and variance: 
this step requires efficient use of the FFT to invert the CDS spread formula); 

3. For t = 1 : M - 1 
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(a) Approximate p(X t +i, Y< t , O) given by (21 ) by a truncated normal density with 
mean and variance computed by matching moments. For this one uses the 
exact formula for the first two moments of the conditional transition density 
rt5J), and the assumed normal form of f(Y t \X t ) and p(X t , Y <t , O); 

(b) Compute the measurement density f(Y t+ i\X t +i) (ie. compute its mean and 
variance, again with efficient use of FFT); 

(c) End loop; 



4. Finally compute p(Y< M ,Q) by integrating X M as in (20). 



8 Numerical Implementation 

From the considerations described in section [6] we fix f3 = —0.5, a = 0.3, b = 0.2. We 
choose u = 300 which controls the truncation error within 10~ 10 . Depending on G, we 
allowed the size of the FFT lattice, N, to vary from 2 s to 2 10 , keeping the discretization 
error within 10~ 10 . We use the Matlab function fmincon to implement the quasi-Newton 
method to maximize the likelihood function. Since fmincon also calculates the gradient 
and Hessian of the objective function, we also obtain standard errors of the parameter 
estimates. 

Table [T] summarizes the estimation results for each of the three models, for the three 
datasets in 2006-2010, using our time series approximate inference. Estimated parameter 
values are given with standard errors, as well as summary statistics for the resulting 
filtered time series of X t . We also present the root mean square error (RMSE) defined as 
the average error of the CDS spreads quoted in units of the bid/ask spread. 



RMSE = 

Overall, the finite activity EXP model shares quite a few similarities with the infinite 
activity VG model, both in behavior and performance. For these two TCBM models, 
their model parameters are quite similar between dataset 1 and dataset 3 respectively. It 
is consistent with Ford's history of credit ratings that dataset 3 has lower, more volatile 
log- leverage ratios and lower recovery rate than dataset 1. We can also see that during 
the peak of the credit crisis in dataset 2, the estimated parameters show noticeable signs 
of stress. The mean time change jump size is up by approximately 50%, driven mainly 
by the increased short term default probability. The recovery rate is significantly lower. 
In the very stressed financial environment at that time, a firm's value would be greatly 
discounted and its capacity to liquidate assets would be limited. On the other hand the 
risk neutral drift (3q is significantly higher, reflecting a certain positive expectation on the 
firm. At the peak of the credit crisis, Ford's annualized credit spreads exceeded 100%. 
The log- leverage ratios are much suppressed to a level of about 65% of that of dataset 1. 

By definition, RMSE measures the deviation of the observed CDS spreads from the 
model CDS spreads while r\ measures the deviation of the "observed" log-leverage ratios 



1 



M-K 



^^{F*(X t ,&)-Y t k Y 



t=\ k=i 
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Xt from the "true" log-leverage ratios X t . We can see that RMSE and rj are very close in 
all cases, which implies that the objective functions based on the naive CDS measurement 



density (14) and the linearized measurement density (16) are fundamentally very similar. 

In terms of RMSE and rj, both TCBM models performed much better than the Black- 
Cox model. The TCBM fitting is typically within two times the bid/ask spread across 
3 datasets, while the errors of the Black-Cox model are about 30% higher on average. 
Figure [T] shows that on three typical days, the TCBM models can fit the market CDS 
term structure curves reasonably well while the Black-Cox model, with its restrictive 
hump-shaped term structures, has difficulties for some tenors. To fit high short spreads, 
the log-leverage ratio is forced to unreasonably low levels. The TCBM models, with only 
one extra parameter than the Black-Cox model, generate more flexible shapes, and do a 
better job of fitting the data. 

Figure [2 displays histograms of the signed relative error (w k )~ l ( y F k {X t , 0) — Y k> ) for 
the three models, for the short and long end of the term structure. For both TCBM 
models we can see that most errors are bounded by ±2 and are without obvious bias. 
By comparison, the errors of the Black-Cox model are highly biased downward in the 
both the short and long terms. For 1-year spreads the majority of errors stay near -2 and 
for 10-year spreads there is a concentration of errors near -4. Surprisingly, all the three 
models perform better and more closely to one another during the crisis period of dataset 
2. For the TCBM models, the great majority of errors are near and without obvious 
bias. The Black-Cox model does not have obvious bias either, but there are more errors 
beyond the ±2 range. The performance of all three models is better for intermediate 
tenors between 1 and 10 years, with the mid-range 5-year and 7-year tenors having the 
best fit. The histograms for these tenors (not shown) do still indicate that the TCBM 
models perform better than the Black-Cox model, in regard to both bias and absolute 
error. 

The estimation results (not shown here) using the Kalman filter method described in 
Remarks [5] are very close to the results shown in Table [TJ indicating that the transition 
density can be safely approximated by a gaussian density. The Kalman filter is convenient 
for calculating the weekly likelihood function, which is needed in the Vuong test JTS], a 
test to compare the relative performance of nested models. If X t and Pt denote the ex-ante 
forecast and variance of time t values of the measurement series obtained from Kalman 
filtering, the weekly log-likelihood function can be written as 

l t = -\ log \P t \ ~\{X t - X t Y{P t )-\X t -X t )-J2 /*(*«*, 9). (30) 

k 

The log-likelihood ratio between two models i and j is 

M 



4=1 



and the Vuong test statistic is 



t. - 

I i 
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where sfj is the variance of {la — ljt}t=i,...,M- Vuong proved that %j is asymptotic to a 
standard normal under the null hypothesis that models % and j are equivalent in terms 
of likelihood function. Due to the serial correlation within the log-likelihood functions, 
Newey and West's estimator [H] is used for s. The Vuong test results are shown in Table 
[2] and confirm that the Black-Cox model is consistently outperformed by the two TCBM 
models. Moreover, by this test, the EXP model shows an appreciable improvement over 
the VG model that could not be easily observed in the previous comparison. 

It is interesting to compare the time series of Ford stock prices to the filtered log- 
leverage ratios X t . Fig [3] shows there is a strong correlation between these two quantities, 
indicating that the equity market and credit market are intrinsically connected. The 
empirical observations supporting this connection and thereafter financial modeling inter- 
preting this connection can be found in [11] , j3] and their references. 

Finally, we mention that a stable model estimation over a 78 week period typically 
involved about 120 evaluations of the function p(Y, 0), and took around one minute on a 
standard laptop. 

9 Conclusions 

In this paper, we demonstrated that the Black-Cox first passage model can be efficiently 
extended to a very broad class of firm value processes that includes exponential Levy 
processes. We tested the fit of two realizations of Levy subordinated Brownian motion 
models to observed CDS spreads for Ford Motor Co., a representative firm with an in- 
teresting credit history in recent years. We found that the two Levy process models can 
be implemented very easily, and give similarly good performance in spite of the very dif- 
ferent characteristics of their jump measures. With one extra parameter, both models 
outperform the Black-Cox model in fitting the time series of CDS term structures over 
1.5 year periods. However, they still have limitations in fitting all tenors of the CDS term 
structure, suggesting that further study is needed into models with more flexible time 
changes. 

We also proposed a new method for filtered statistical inference, based on what we 
call the linearized measurement equation. This new method inductively creates "quasi- 
gaussian" likelihood functions that can be approximated either as truncated gaussians, 
or as true gaussians in which case we are lead to a Kalman filter. By their strategic 
use of the fast Fourier transform, both of our two approximation methods turn out to 
be very efficient: parameter estimation for a time series of term structures for 78 weeks 
can be computed in about a minute. Finally, we observe a strong correlation between 
Ford's stock price and the filtered values of its unobserved log-leverage ratios. This final 
observation provides the motivation for our future research that will extend these TCBM 
credit models to TCBM models for the joint dynamics of credit and equity. 
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Table 1: Parameter estimation results and related statistics for the VG, EXP and Black- 
Cox models. X t derived from (23) provide the estimate of the hidden state variables. The 
numbers in the brackets are standard errors. The estimation uses weekly (Wednesday) 
CDS data from January 4th 2006 to June 30 2010. x s td is the square root of the annualized 
quadratic variation of X t . 
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Table 2: Results of the Vuong test for the three models, for dataset 1, dataset 2 and 
dataset 3. A positive value larger than 1.65 indicates that the row model is more accurate 
than the column model with 95% confidence level. 
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Figure 1: The in-sample fit of the two TCBM models and Black-Cox model to the observed 
Ford CDS term structure for November 22, 2006 (top), December 3, 2008 (middle) and 
February 24, 2010 (bottom). The error bars are centered at the mid-quote and indicate 
the size of the bid-ask spread. 
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Figure 2: Histograms of the relative errors, in units of bid-ask spread, of the in-sample fit 
for the VG model (blue bars), EXP model (green bars) and Black-Cox model (red bars) 
for dataset 1 (top), dataset 2 (middle) and dataset 3 (bottom). The tenor on the left is 
1-year and on the right, 10-year. 
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Figure 3: Filtered values of the unobserved log- leverage ratios X t versus stock price for 
Ford for dataset l(top), 2 (middle) and 3 (bottom). 
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